WikiTopics: What is Popular on Wikipedia and Why
نویسندگان
چکیده
We establish a novel task in the spirit of news summarization and topic detection and tracking (TDT): daily determination of the topics newly popular with Wikipedia readers. Central to this effort is a new public dataset consisting of the hourly page view statistics of all Wikipedia articles over the last three years. We give baseline results for the tasks of: discovering individual pages of interest, clustering these pages into coherent topics, and extracting the most relevant summarizing sentence for the reader. When compared to human judgements, our system shows the viability of this task, and opens the door to a range of exciting future work.
منابع مشابه
Why We Read Wikipedia
Wikipedia is one of the most popular sites on the Web, with millions of users relying on it to satisfy a broad range of information needs every day. Although it is crucial to understand what exactly these needs are in order to be able to meet them, little is currently known about why users visit Wikipedia. The goal of this paper is to fill this gap by combining a survey of Wikipedia readers wit...
متن کاملComparing the usage of global and local Wikipedias with focus on Swedish Wikipedia
This report summarizes the results of a short-term student research project focused on the usage of Swedish Wikipedia. It is trying to answer the following question: To what extent (and why) do people from non-English language communities use the English Wikipedia instead of the one in their local language? Article access time series and article edit time series from major Wikipedias including ...
متن کاملSearching Wikipedia: learning the why, the how, and the role played by emotion
Searching Wikipedia has been the focus of study for an increasing number of information retrieval publications. In recent years different IR tasks have used Wikipedia as a basis for evaluating algorithms and interfaces for various types of search tasks, including Question Answering, Exploratory Search, Entity Search and Structured Document retrieval. Despite being associated with these well-def...
متن کاملReflections on the Ontology of Mass Art as Avant-garde Art
The aesthetics of “everyday experience” is made by a paradigm that defines the duality of the Elite art/low-class art by focusing on the art of the majority/minority, based on the desire of the urban middle class, It has been put as the art of the people in opposition to the art of avant-garde. The study is based on reflections on possibility of finding the main origin between the art of the ma...
متن کاملA Critical Study of the Views on the Why of Not Mentioning Uli `lamr as the Source of Conflict in the Verse 59 Nias
The present descriptive-analytic research has examined and interpreted the commentators' view on the why of not mentioning Uli `lamr as a source of conflict resolution in the verse 59 of Sura Nisa. In various respects, some have considered lack of innocence as the reason for not mentioning them as a source of dispute resolution, some view the people`s obligation in following the first part of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011